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Abstract 

In this paper, we propose a disambiguating technique called controlled 
disjunctions. This extension of the so-called named disjunctions relies on 
the relations existing between feature values (covariation, control, etc.). 
We show that controlled disjunctions can implement different kind of am- 
biguities in a consistent and homogeneous way. We describe the integra- 
tion of controlled disjunctions into a HPSG feature structure representa- 
tion. Finally, we present a direct implementation by means of delayed 
evaluation and we develop an example within the functionnal program- 
ming paradigm. 

1 Introduction 

Ambiguity can affect natural language processing at very different levels: it 
can be very local (e.g. limited to a feature value) or conversely affect entire 
syntactic structures. But the general disambiguating process remains the same 
and rely in particular on contextual information. Unfortunately, there exists 
very few solutions providing a general account of such a process with a direct 
and efficient implementation. 

In this paper, we propose an approach allowing a general and homogeneous 
representation of ambiguity and disambiguation relations. This approach con- 



stitutes an extension of named disjunctions (cf. Dorrc9C | ) and allows a direct 
implementation of relations controlling the disambiguation. 

This paper is threefold. In the first part, we approach the question of the 
representation and we situate our method among the most representative ones. 
We describe in particular the advantages and drawbacks of named disjunctions 
and show how different phenomena can be described within such a paradigm. 
In the second part, we propose an analysis of the disambiguating process it- 
self and describe some of the control relations existing between the different 
parts of the linguistic structure. We integrate the representation of such rela- 
tions to the named disjunction approach: we call this new technique controlled 



1 



disjunction. The last section presents the implementation of controlled dis- 
junction which uses different delayed evaluation techniques such as coroutining, 
constraint propagation or rcsiduation. 



2 Representing Ambiguity 

Ambiguity is generally a problem for NLP, but it can also be conceived as a 
useful device for the representation of particular linguistic information such as 
homonymy, valency variations, lexical rules, etc. In this perspective, a general 
representation is very useful. 



2.1 Different needs for representing ambiguity 

Ambiguity can be more or less complex according to its scope and there is a 
general distinction between global and local ambiguities. We can also observe 
such a difference from a strictly structural point of view: ambiguity can affect 
the subcomponents of linguistic objects as well as entire structures (or, in a 
feature structure perspective, atomic as well as complex feature values). This is 
the reason why we distinguish two fundamental properties: (i) the interconnec- 
tion between different subcomponents of an ambiguous structure and (m) the 
redundancy of such structures. 

Moreover, we can introduce a certain kind of dynamicity: ambiguity affects 
the objects differently depending on whether it is related to external values 
or not. More precisely, certain ambiguities have only a minimal effect on the 
structure to which they belong whereas some others can deeply affect it. A 
value can be poorly or strongly related to the context and the ambiguities are 
more or less dynamic according to the density of the entailed relations. 



Figure (2.1) shows several ambiguities involving different kind of relations 
between feature values. In these examples, some ambiguities (e.g. (l.a)) have no 
effect on the other features whereas some features are strongly interconnected 
as in (l.c). Let us remark that the feature type has no consequence on such 
relations (see for example (l.d) and (l.c)). 

The second problem concerns redundancy: when an ambiguity affects major 
feature values such as category, the result is a set of structures as for (l.b): 
the ambiguity between the verb and the noun involves in this example two 
completely different linguistic structures. But there are also ambiguities less 
interconnected with other parts of the structure (e.g. (l.d)). In these cases, a 
common subpart of the structure can be factorised. This is particularly useful 
for the representation and the implementation of some descriptive tools such as 



lexical rules (see | Brcdckamp96 



An efficient ambiguity representation must take into account the intercon- 
nection between the different subparts of the structure and allow a factorisation 
avoiding redundancy 
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Figure 1: Examples of interconnection between feature values 



2.2 Different Representations 



The disjunctive representation of ambiguity remains the most natural (see [ Karttunen984 
[Kay85 or [ Kasper90| ). However, this approach has several drawbacks. First 



of all, if an ambiguity affects more than one atomic value, then a classical dis- 
junction can only represent variation between complete structures. In other 
words, such a representation doesn't allow the description of the relations exist- 



ing between the features. In the same way, several approaches (see [\Iaxwell91 
[ Nakazawa88 1 , [ Ramsay9C| , Dawar90 or [ Johnson90| ) propose to rewrite dis 
junctions as conjunctions (and negations). This method, in spite of the fact that 
it can allow efficient implementations of some ambiguities, presents the same 
drawback. 

A partial solution, concerning in particular the redundancy problem, can be 
proposed with the use of named disjunctions (noted hereafter N D; also c alled 
distributed disjunctions). This appro a ch has been de scri bed by |Dorrc90 and 
used in several works (see [ Krieger93 1 , | Gerdcmann95 ] or [ Blachc9(: ] ) . The dis- 
junction here only concerns the variable part of a structure (this allows the 
information factorisation). 

A ND binds several disjunctive formulae with an index (the name of the 
disjunction). These formulae are ordered and have the same arity. The variation 
is controlled by a covariancy mechanism enforcing the simultaneous variation 
of the ND values: when one disjunct in a ND is chosen (i.e. interpreted to 
true), all the disjuncts occurring at the same rank into the other ND formulae 
also have to be true. Several NDs can occur in the same structure. Figure (||) 
presents the lexical entry corresponding to example (l.d). In the following, the 



shaded indices represent the names of the disjunctions. 
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This example shows a particular case of subvariation. Indeed, the plural 
dative determiner stipulates no constraints on the gender. We note this subvari- 
ation with an anonymous variable. 

However, the ND technique also presents some drawbacks. In particular, 
covariancy, which allows a local representation of ambiguity, is the only way 
to represent relations between features. Such structures need redundant infor- 
mation as shown in example (|J) (cf. [ Kriegcr93| ) . Moreover, the semantic of 
the disjunction is lost: this is no longer a representation of a variation between 
values, but between different set of values. 
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In the same perspective, covariancy forces a symmetrical relation between 
two feature values. In fact, there are relations expressing finer selection relations. 
This is the case in (l.c) in which no covariancy between the part-of-specch and 
gender features occurs: an adjective is cither masculine or feminine whereas 
a noun is always masculine. A classical ND representation as in example (||) 
doesn't account for the fact that the noun selects a masculine value for the 
gender feature, but the reverse is not true. 



3 Disambiguating 

Generally, disjunctions are expanded into a disjunctive normal form and the 
disambiguation comes to a well-formedness verification. In case of incoherence, 
backtracking is applied and another model is elaborated (i.e. another value is 



choosen). This method can be improved by some techniques (cf. [Maxwell91]), 
but the basic mechanism remains a kind of generate-and-test device: the first 
feature instantiation leads to the choice of an entire structure which has to be 
validated during the parse. 



3.1 Fundamental Needs 



A natural solution consists in delaying the evaluation of the structure con- 
sistency. We highlight here two techniques used in our approach: selection 
constraints (cf. [Pulman96|) and coroutining (cf. [van Noord94 |Blache96|| ). 

| Pulman9l| represents ambiguities with fixed-arity lists. They are controlled 
by an agreement-like device between two categories, one being controlled by the 
other. These lists can be interpreted as disjunctions (not expanded into a nor- 
mal form) and their evaluation requires contextual information. The problem is 
that this mechanism can only be represented by a phrase-structure rules, and not 
at a general level. This is problematic in two respects: it is a context-sensitive 
mechanism and it only works for formalisms using phrase-structure rules (unlike 
HPSG for example). Moreover, this approach involves the definition of a new 
kind of categorial relations (cf. in Pulman's paper the relation between prepo- 
sitions and nouns) which have no linguistic motivation. Finally, the design of 
several control relations can rapidly become an exhaustive description of all the 
dependencies. 



A coroutining approach applied to named disjunctions is described in [ Blachc96| . 
The idea is to propose the on-line use of NDs relying on (actual) delayed eval- 
uation. The advantage is a direct interpretation of NDs: this representation of 
ambiguity doesn't use any pre-computation, there are no normal form expansion 
and the same mechanism is reused for all kind of ambiguities. In this approach, 



NDs are no longer simple "macros" as described in [ Brcdekamp96 : there is no 



interpretation of this formalisation into another one and the evaluation is direct. 
This is possible because the factorisation allows the construction of an under- 
specified structure which can be used during the parse. The problem however 
concerns the disambiguation propagation. More precisely, the only control is 
that of the covariancy which doesn't adress the problem of embedded NDs. 

An efficient treatment of ambiguity would need a factorised representation 
allowing the use of underspecification together with a fine specification of disam- 
biguating relations. These characteristics are fundamental for an implementa- 
tion avoiding DNF expansion and delaying disambiguation as much as possible. 



3.2 Controlled Disjunctions 

The main interest with NDs is the relation between different disjunctive formulae 
implementing the disambiguation propagation. This propagation supposes an 
equivalence relation in the sense that any value belonging to a ND can be 
instantiated and propagates the information to the rest of the ND. But this 
cannot tackle the problem of directed disambiguation relations as shown in 
example (^). In this example, we can say that (i) if the word is a noun, then 
the gender is masculine and (it) if the word is feminine, then it is an adjective. 
But the reverse is not true: we cannot deduce anything if the masculine value 



is instantiated. 
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We propose in the following the introduction of the notion of controlled 
disjunctions. They allow an integration of all the control relations relying on the 
distinction between controlling and controlled values within the ND framework. 

The first problem being that of covariancy, we identify two kind of disjunc- 
tions: 

• Simple disjunctions: their values can be controlling, controlled or both. 
They don't belong to a set of disjunctive formulae as for classical NDs but 
are represented in a separate ND which has its own name and contains 
this unique formula. This is the example case with the gender value of 
examples (l.c) or (l.e). 

• Covariant disjunctions: their values are necessarily both controlling and 
controlled. These formulae are equivalent to classical NDs. 

Insofar as covariancy is not the only relation between features, the second 
problem is the representation of control relations. In a covariant relation, all 
values occuring at the same rank of a formula are both controlling and con- 
trolled. Conversely, the description of non-covariant relations (involving simple 
disjunctions) relies on the distinction between controlling and controlled val- 
ues. In example (l.c), the noun value controls (at least) the masculine, and the 
feminine controls the adjective. 

A general solution consists of specifying explicitly all the non-covariant con- 
trol relations. As said before, controlled disjunctions (hereafter CD) are a ND 
extension in the sense that they are named and their formulae are ordered. So, 
any value can be referenced by the name of the disjunction and its rank in the 
formula. In the case of a simple CD, these references specify a unique value 
whereas for a covariant CD, they specify a set of covariant values. 

We note this control relation as: value^ ^ where value is the name of the 
controlling value, i is the name of the CD and j is the rank of the controlled 
value in the formula. 

Figure (|J) is a representation in an HPSG notation of the example (l.c). 
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This structure contains two controlled disjunctions indexed by 1 and 2 . 
Disjunction 1 is covariant and the instantiation of one of its value entails that 
of the other values of the same rank belonging to the same CD. Disjunction 2 
is simple and specifies two possible values. The control relations are specified 
by the controlling values. These values can either belong to a covariant or a 
simple disjunction. In this example, the controlling values are noun ^ 2 and 

fern . they control respectively the 1st value of the 2nd disjunction (i.e. 

masc) and the 2nd value of the 1st disjunction (i.e. adj and all its covariant 
values). 

We can remark that, in the case of typed structures, the elements belonging 
to a control relation can either be feature values or types: this is the case of the 
first disjunction which concerns the head types noun and adj. This property 
can be very useful for the expression of high-level ambiguities. 

The example of figure (JsJ) shows a more complex case with embedded con- 
trolled disjunction. But we can observe that the mechanism is the same. 
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In another way, simple disjunctions, in addition to their controlling capaci- 
ties, can be assimilate to finite domains contraints (in the constraint program- 
ming sense). Such constraint are very useful for several reasons: they constitute 
an efficient control on the unification process. Moreover, if no disambiguation 
can be applied, they propose an approximation of the solution. 



4 Delayed Evaluation 

Delayed evaluation techniques allow us to maintain the disjunctive structure 
until disambiguating information is found. This has several advantages. First 
of all, there are no arbitrary choices as with classical disjunctive approaches 
relying on normal forms: all instantiations are guided by contextual informa- 
tions. Moreover, the disambiguation propagation of CD allows us to trigger the 
process at any level: in the case of HPSG, the disambiguation can come from 
principles, schemas or even at the lexical level with structure sharing. 

We propose in this paper an approach using LIFE, a multi-paradigm program- 



ming language (cf. [ A'it-Kaci94|) integrating in particular functionnal facilities 
to a logic programming framework. 

A direct implementation of the CD presents two problems: the different 
kind of relations (covariancy and control) and the control of delaying devices. 



We propose here a method relying on functions which residuate if their argu- 
ments are insufficiently specified. As soon as these arguments are instantiated, 
their evaluation is fired. This mechanism is very similar to the delayed ones 
in Prolog (freeze, block, etc.) and correspond in some way to a constraint 
programming approach. 

We distinguish for the implementation three kind of relations implemented 
by three different functions. These functions are associated with the values 
involved in the CD. Two functions implements the controlling and controlled 
relations and one describes the directed (non-covariant) selection. 

The representation used for this relies on the following points: 

• disjunctive formulae are represented with lists, 

• the name of the CD is represented by an integer variable /, 

• / also represents the rank of the instantiated disjunct in the list. 

The residuation depends on (i) the feature value or (it) the rank of the 
choosen value in the list. When a feature value can be disambiguated, its 
instantiation in the list entails the instantiation of its rank. The other values 
controlled by the same name (i.e. belonging to the same CD) are controlled by 
a function residuating on the integer variable representing this rank. So, when 
a value is choosen, all the values of the same rank in the disjunction are also 
instantiated. The mechanism is the same with directed selections. In this case, 
we need a particular function binding two specific values which don't belong 
to the same CD. These relations are expressed again with the integer variable. 
To summarize (i) if a value belonging to a CD is instantiated, then its rank 
become known and (ii) if the rank is known, then the value can be instantiated. 
Let us describe more precisely these functions. In the following, the function 
cond(a,b,c) means that if a is true, then b, otherwise c. 

• Function controlling : controiiing(i,A,L) — > 

controlling_f ollowedd , I , A,L) . 
controlling.f ollowed(J,I,A, [X|L] ) — > 

cond(A:==X, A | I=J, controlling.! ollowed( J+l , I , A,L) ) . 

I is an integer representing the rank, A the feature value to be instantiated 
and L the list of possible values. This function rcsiduates on A. If A is 
known, its evaluation is fired, the function calculates the position of the 
corresponding value in the list L and returns the value itself. 

• Function controlled : controlled ( i, [x I l]) — > 

cond(I>0, 

cond(I=:=l,X,controlled(I-l,L)) , 
false) . 

The two arguments are the rank (variable I) and the list of values ( [X I L] ). 
The function residuates on I. When I is known, then the function returns 
the value corresponding to this position in the list. 



• Function selection : selection (i,x,y) — > 

cond(I>0, 

cond(I=:=X,Y,true) , 
false) . 

I represents the position of a controlling value (and the name of the cor- 
responding CD), X represents the value that I must have to select the 
controlled value and Y represents the rank of the controlled value to be 
instantiated. This function residuates on I. It returns the position of the 
controlled value in case the controlling one has the right value. 

Let us present now the implementation of the lexical entry of the figure (|J) . 

word (mobile, A) :- 

A.phon=mobile , 

A. synsem. loc . cat .head= H , —^j Lists 

A. synsem. loc . cat .valence . spr= S 

A . synsem . loc . content . index . gender= G 

H=controlling(I,H, [adj , noun]), 
(7) S=controlling(I,S, [[] , [Det]]), 

G=controlling(J,G, [masc, fern ]) , ^ Control features 

H=controlled(I , [adj ,noun] ) , 

S=controlled(I, [[] , [Det]] ) , 
Disjunction G=controlled(J, [masc, fern] ) , 

names^— I =selection( J, 2, 1) , 

J =selection(1 , 1 ,2) . 

This lexical entry bears two CDs: a covariant one controlled by I and a 
simple one controlled by J. In this example, all the disjunctive formulae contain 
values which can be controlled or controlling (this is necessarily the case for 
covariant CD, but not for the simple ones). So the three corresponding lists 
are in the scope of both the controlling and controlled funtions. Moreover, the 
particular directed relations between noun/masc and fem/adj arc implemented 
by the selection function which indicates for example in its first occurrence that 
if the second value of the CD J is instantiated, then the CD I must instantiate 
the first value (i.e. if 3=2, then 1=1). 



5 Conclusion 

The controlled disjunction technique presented in this paper allows a concise 
representation of disjunctive information, the description of precise variation 
phenomena (not only covariancy as with ND) together with an efficient imple- 
mentation relying on underspecification and delayed evaluation. It consitutcs 
an important extension of the named disjunction device: its representation of 
control phenoma between disjunctive values allow the implementation of all kind 
of ambiguities. 

This approach can be improved in several respects and in particular with 
consistency tests: verification that disjunctions are exclusive, no cycles in the 
dependency net of relations between disjunctive values, etc. 



The implementation of controlled disjunctions is direct using classical de- 
layed evaluation techniques. The example proposed in this paper relies on func- 
tional programming, but other logic programming techniques can also be used. 
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